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Protozoan Rhomboid Proteins 



The present invention relates to the invasive processes of 
protozoan pathogens and in particular to the provision of 
therapeutic compounds which block these processes and may 
thus be useful in treating protozoan pathogen infection.. 

Protozoan pathogens of the Apicomplexa family, which 
include the malaria parasite P. falciparum, express a 
number of membrane-tethered adhesion proteins that are 
essential for the recognition and binding of host (e.g. 
mammalian) cells. An essential step in the invasive 
processes by which these pathogens enter host cells is the 
release of these adhesion proteins from the surface of the 
pathogen by proteolytic cleavage. Some of . these adhesion 
proteins are cleaved within the transmembrane domain (TMD) 
(Opitz et al (2002) EMBO J. 21 7 1577-1585) . 

The present inventors have discovered that Rhomboid 
polypeptides are involved in the cleavage of protozoan 
adhesion molecules. Rhomboid polypeptides are intra- 
membrane serine proteases, which are widely conserved 
throughout evolution and act on a range of physiological 
substrates. Inhibition of -the protozoan Rhomboid activity 
reduces adhesion polypeptide cleavage and may thus reduce 
the invasiveness of the protozoan pathogen. 

A first aspect of the invention provides a method for 
identifying and/or obtaining a compound that inhibits 
invasiveness of a protozoan pathogen, for example by 
inhibiting the activity of a protozoan Rhomboid 
polypeptide, which method comprises: 
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(a) bringing into contact an isolated Rhomboid polypeptide 
and an isolated substrate polypeptide in the presence of a 
test compound; and 

(b) determining proteolytic cleavage of the substrate 
polypeptide . 

The Rhomboid and substrate polypeptides may be contacted 
out under conditions in which the Rhomboid polypeptide 
normally catalyses proteolytic cleavage of the substrate 
polypeptide . 

The polypeptides may be contacted in a reaction medium in 
an isolated form or may be comprised in a liposome or host 
cell, preferably, a host cell in which the Rhomboid 
polypeptide and substrate are not naturally expressed. The 
Rhomboid polypeptide may, for example, act "on a membrane- 
bound substrate polypeptide to generate a soluble product, 
which is detected. 

Cleavage of the substrate polypeptide may be determined in 
the presence and- absence of test compound. A reduction or 
decrease in cleavage in the presence of the test compound 
relative to the absence of test compound may be indicative 
of the test compound being an inhibitor of protozoan - 
Rhomboid protease activity. Such a compound may inhibit 
adhesive micronemal polypeptide cleavage and thus protozoan 
pathogen infectivity. 

A Rhomboid polypeptide may be a protozoan Rhomboid protein, 
for example a Rhomboid polypeptide from an apicomplexan 
pathogen. Suitable protozoan Rhomboid polypeptides include 
any one of Rhomboids 1-7 of P. falciparum as shown in Table 
la. 
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In some preferred embodiments, the Rhomboid polypeptide may 
be a non-mitochondrial Rhomboid, such as Rhol, Rho3, Rho4 , 
Rho6 or Rho7 of P. falciparum, in particular Rhol, . Rho3, 
5 Rho4 or Rho7 . 

Amino acid residues of Rhomboid and substrate polypeptides 
are described in the present application with reference to 
their position in the Drosophila Rhomboid-1 and Spitz 

10 sequences, respectively. It will be appreciated that the 
equivalent residues in other Rhomboid and substrate 
polypeptides may have a different position and number, . 
because of differences in the amino acid sequence of each 
polypeptide. These differences may occur, for example, 

15 through variations in the length of the N terminal domain. 
Equivalent residues in other Rhomboid and substrate 
polypeptides are easily recognisable by their overall 
' sequence .context and by their positions with respect to the* 
TMDs . 

20 

A polypeptide which is a member of the Rhomboid family 
preferably comprises residues R152, G215, S217 and H281, 
more preferably residues W151, R152, N169, G215, S217 and . 
H281. The presence of these conserved residues may be used 
25 to identify Rhomboid polypeptides in other protozoan 
pathogens . 

Preferably, a Rhomboid polypeptide comprises at least 4 
TMDs , more preferably at least 5 TMDs, with residues N169, 
30 S217 and H281 each occurring in different TMD at about the 
same level in the lipid membrane bilayer.. 
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Preferred Rhomboid polypeptides comprise a GxSx motif, for 
example GxSG. 

A Rhomboid polypeptide may also comprise additional amino 
5 acid residues which are heterologous to the Rhomboid 

sequence. For example, a Rhomboid polypeptide or fragment 
thereof may be included as part of a fusion protein, e.g. 
including a binding portion for a different ligand. 

10 Whilst Rhomboid polypeptides may share relatively low 

overall sequence identity, they are reliably identified by 
bioinf ormatics techniques and manual inspection of key 
residues, as described herein. 

15 A polypeptide which is a member of the Rhomboid family 
(i.e. a Rhomboid polypeptide) may be identified by the 
presence of a Rhomboid homology domain, as defined by the 
PFAM protein structure annotation project (Bateman A. et al 
(2000) The Pf am Protein Families Database Nucl . Acid. Res. 
20 28 263-266). The Pfam rhomboid homology domain is built 
f rpflti a Hidden Markov Model (HMM) using 83 rhomboid 
sequences as a seed. The Pfam ^rhomboid' domain has the 
pfam specific accession number PF01694. 

25 Various other methods suitable for use in identifying 
Rhomboid polypeptides are known in the art. 

Particularly valuable methods include the use of Hidden 
Markov Models built from groups of previously identified 
30 Rhomboid proteins, including, but not limited to Drosophila 
Rhomboids 1-4. Such bio-informatics techniques are . well 
known to those skilled in the art (Eddy S. R. Curr. Opin. 
Struct. Biol. 1996 6(3) 361-365). 
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Rhomboid polypeptides are preferably able to 
proteolytically cleave one or more of Drosophila proteins 
Spitz, Keren and Gurken or protozoan proteins such as Amal 
5 and CTRP, but not similar type I transmembrane proteins 
like TGFa, Delta, EGFR, and TGN38 . 

Suitable substrate polypeptides are Type 1 membrane 
proteins with a single. TMD orientated with an N terminal 

10 extracellular/lumenal domain- A substrate polypeptide may 
comprise a transmembrane domain which has a lumen proximal 
region having one or more of the residues of residues 138- 
144 of the Drosophila Spitz sequence (ASIASGA) , or a 
lumenal region having an equivalent conformation, structure 

15 or three dimensional arrangement to that of residues 138- 
144 of the Drosophila Spitz sequence (ASIASGA) . 

In preferred embodiments ,.. residues 1 to 7 of the TMD of a 
substrate polypeptide are not hydrophobic. 

20 

A substrate polypeptide may comprise a G residue within the 
portion of the TMD proximal to the lumenal or extracellular 
domain of the polypeptide (i.e. between residues 1 and 8 of 
the TMD) , preferably, the G residue being at a position 

25 equivalent to Spitz G143 (i.e. the 6 th residue of the TMD) . 
Preferably, a substrate polypeptide comprises a GA or GG 
motif within the portion of the TMD proximal to the lumenal 
or extracellular domain of the polypeptide (i.e. between 
residues 1 and 8 of the TMD) , preferably at positions 

30 equivalent to G143 and A144 of the Spitz sequence (i.e the 
6 th and 7 th residues of the TMD) . 
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A substrate polypeptide may further comprise a triplet 
motif of small amino acid residues at positions equivalent 
to A138, S139 and 1140 of Spitz (i.e. the 1 st , 2 nd and 3 rd 
residues of the TMD as numbered from the N terminal 
(extracellular) boundary of the TMD) . Small residues 
include Gly, Ala, Ser, He, Leu, Val and Thr. Suitable 
motifs at this position include motifs which have Ala at a 
position equivalent to Spitz A138, such as AGG and ASI . 
The presence of Phe at positions 1, 2 or 3 of the TMD of a 
polypeptide is a counter-indication for the polypeptide 
being a Rhomboid substrate (i.e. such a polypeptide is 
unlikely to be efficiently cleaved by a Rhomboid 
polypeptide) . 

Suitable substrate polypeptides may include protozoan 

adhesive micronemal polypeptides. An adhesive micronemal 

polypeptide may be from a protozoan pathogen, for example 

an Apicomplexan pathogen. Suitable adhesive micronemal 

polypeptides include transmembrane proteins that belong to 

the thrombospondin-related-anonymous proteins (TRAPs) 
family. 

A protozoan adhesive micronemal polypeptide which is a 
suitable substrate may include an AGG or AGL motif at 
residues 1, 2 and 3 of the TMD as described above and a GG 
motif at positions 6 and 7 of the TMD. 

The protozoan pathogen may a member of the Apicomplexa 
family, for example an apicomplexan pathogen selected from 
the group consisting of Plasmodium, Toxoplasma, Eimeria, 
Sarcocystis, Babesia, Isospora, Cyclospora and 
Cryptosporidium, for example P. falciparum, P . bergei, T. 
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gondii, C. parvus, £. tenella, 5* muris, Babesia bovis, 
Cyclospora belli, Theileria annulata or Theileria parva. 

Examples of suitable polypeptides include AMA1, MIC2, MIC6, 
5 MIC8, MIC12, (from T. gondii), AMA1,.TRAP, CTRP (from P. 

falciparum and P. berghei) , MIC1 and MIC4 (from E. tenella) 
or other polypeptide as shown in Table 2. 

.In some preferred embodiments, the substrate polypeptide 
10 may be CTRP or AMA1, for example P. falciparum CTRP or AMA1 

The substrate polypeptide and the Rhomboid polypeptide may 
each comprise an ER (endoplasmic reticulum) retention 
signal. For example, a rhomboid polypeptide may comprise a 
15 C terminal (lumenal) KDEL motif and a substrate may 

comprise a C terminal (cytoplasmic) KKXX motif (Jackson et 
al (1993) J. Cell Biol. 121(2) 317-333) 

The substrate polypeptide may comprise a detectable label, 
20 such as green fluorescent protein (GFP) , luciferase or 
alkaline phosphatase. This is preferably located in the 
soluble extracellular domain, allowing convenient detection 
of the soluble cleaved product and is particularly useful 
in automated assays. 

25 

Methods for obtaining or identifying modulators, in 
particular inhibitors, of protozoan pathogen infectivity 
may be cell-based or non-cell-based. 

30 In non-cell based methods, the rhomboid polypeptide and the 
substrate polypeptide may be isolated or contained in a 
liposome. Liposome based assays may be carried out using 
methods well-known in the art (Brenner C. et al (2000) 
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Meths in Enzymol. 322 243-252, Peters et al (2000) 
Biotechniques 28 1214-1219, Puglielli, H . and Hirschberg C. 
(1999) J. Biol. Chem. 274 35596-35600, Ramj eesingh, M. 
(1999) Meths in Enzymol. 294 227-246). 

Preferably, methods according to the present invention take 
the form of cell based methods. A cell based method may be 
performed in a cell such as a protozoan cell (e.g. a 
Toxoplasma cell), a yeast strain, insect or mammalian cell 
line such as CHO, HeLa and COS cells, in which the relevant 
polypeptides or peptides are expressed from one or more 
vectors introduced into the cell. 

In a preferred embodiment, the Rhomboid polypeptide and the 
substrate polypeptide may be expressed in a host cell from 
heterogeneous encoding nucleic acid. 

Nucleic acid encoding the Rhomboid polypeptide and the 
substrate polypeptide may be contained on a single 
expression vector or on separate expression vectors. 

Persons skilled in the art may vary the precise format of 
methods of the invention using routine skill and knowledge. 
For example, in some embodiments, the cleaved GFP moiety of 
a GFP-substrate fusion may be captured with an anti-GFT 
antibody (Santa Cruz Biologicals) , washed and then the 
captured GFP detected with a polyclonal or monoclonal 
antibody conjugated to an enzyme (capture ELISA) or a 
fluorescent label. 

Some embodiments may employ an ELISA format in which, for 
example, a suitable polyclonal anti-GFP conjugated to 
horse-radish peroxidase or to alkaline phosphatase may be 
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used. Such a conjugate is preferred since the number of 
incubations required is reduced- Alternatively, a 
biotinylated anti-GFP antibody in combination with an 
avidin or streptavidin enzyme conjugate may be used. 

In other embodiments, fluorescence detection may be used, 
for example using a Europium- or Terbium-labelled antibody 
or streptavidin e.g. Delphia or Lance reagents, Perkin 
Elmer) . These labels show a long fluorescence lifetime and 
have improved signal: noise ratio characteristics. 

In other embodiments, GFP may be replaced in the GFP- 
substrate construct with a different enzyme label at the N- 
terminus to give a direct assay for the cleaved substrate 
15 in the medium (or the label may be added to such a 

construct) . Suitable enzymes include Renilla luciferase 
(Lui,J., and Escher, A. (1999) Gene 237, 153-159) and 
secretable alkaline phosphatase sequence (SEAP) (Clontech) . . 

20 It is not necessary to use the entire full-length proteins 
for in vitro or in vivo assays of the invention. 
Polypeptide fragments as described herein which retain the 
activity of the full length protein may be generated and 
used in any suitable way known to those of skill in the 

25 art. Suitable ways of generating fragments include, but 
are not limited to, recombinant expression of a fragment 
from encoding DNA. Such fragments may be generated by 
taking encoding DNA, identifying suitable restriction 
enzyme recognition sites either side of the portion to be 

30 expressed, and cutting out said portion from the DNA. The 
portion may then be operably linked to a suitable promoter 
in a* standard commercially available expression system. 
Another recombinant approach is to amplify the relevant 
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portion of the DNA with suitable PCR primers. Small 
fragments (e.g. up to about 20 or 30 amino acids) may also 
be generated using peptide synthesis methods, which are 
well known in the art. Another approach is to synthesise .a 
nucleic acid comprising all or part of the coding sequence 
from a series of synthetic oligonucleotides. The coding 
sequence may be synthesised using a more optimal genetic 
code for the host cells (Kocken CH et al Infect Immun. 
(2002) 70(8): 4471-6), for example reflecting mammalian 
codon usage for expression in mammalian cells. 

A Rhomboid polypeptide fragment may consist of fewer amino 
acid residues than said full-length polypeptide. Such a 
fragment may consist of 325 amino acids or less, 300 amino 
acids or less, or 275 amino acids or less and/or may 
consist of at least 100 amino acids, more preferably at 
least 150, 200, 250 or 300 amino acids. * A suitable 
fragment may comprise five TMDs . 

Such a fragment preferably comprises residues equivalent to 
R152, G215, S217 and H281, more preferably residues W151, 
R152, N169, G215, S217 and H281 of the Drosophila Rhomboid- 
1 sequence (Acc No: P20350) , which are important for the 
catalytic activity of the protein and are highly conserved 
in the Rhomboid family. 

A substrate polypeptide fragment comprises fewer residues 
than the full-length polypeptide and preferably comprises 
the transmembrane domain of the polypeptide. The TMD may be 
conveniently identified using commercially available 
software such as TMHMM (Krogh A. et al (2001) J. Mol . Biol. 
305 567-580) and TmPred (Hofmann K & Stoffel W (1993) Biol 
Chem. Hoppe Seyler 374 166) . 
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In some embodiments,, a chimeric substrate polypeptide may- 
be used which comprises a substrate polypeptide TMD which 
is cleavable by a Rhomboid polypeptide and heterogeneous 
intra- and extra-cellular domains. 

Combinatorial library technology (Schultz, JS (1996) 
Biotechnol. Prog. 12:729-743) provides an efficient way of 
testing a potentially vast number of different substances 
for ability to modulate activity of a polypeptide- Prior 
to or as well as being screened for modulation of activity, 
test substances may be screened for ability to interact 
with the Rhomboid polypeptide, e.g. in a yeast two-hybrid 
system (which requires- that both the polypeptide and the- 
test substance can be expressed in - yeast from encoding 
nucleic acid) . This may be used as a coarse screen prior 
to testing a substance for actual ability to modulate 
protease activity of the polypeptide. 

The amount of test substance or compound, which may be 
added in a method of the invention, will normally be 
determined by trial and error depending upon the type of 
compound used. Typically, from about 0.01 to lOOyM. 
concentrations of putative inhibitor compound may be used, 
for example from 0.1 to 10]aM. 

Test compounds may be natural or synthetic chemical 
compounds used in drug screening programmes. Extracts of 
plants that contain several characterised or 
uncharacterised components may also be used. 

Methods of the invention may comprise the step of 
identifying the test compound as an inhibitor of adhesive 
micronemal polypeptide cleavage. 



WO 2004/040009 




PCT/GB2003/004711 • 



12 



One class of putative inhibitor compounds can be derived 
from the Rhomboid polypeptide and/or substrate polypeptide 
(e.g. adhesive micronemal polypeptide). Peptide fragments 
of from 5 to 4 0 amino acids, for example, from 6 to 10 
amino acids may be tested for their ability to disrupt such 
interaction or activity. 

The inhibitory properties of a peptide fragment as 
described above may be increased by the addition of one of 
the following groups to the C terminal: chloromethyl 
ketone, aldehyde and boronic acid. These groups are 
transition state analogues for serine, cysteine and 
threonine proteases. The N terminus of a peptide fragment 
may be blocked with carbobenzyl to inhibit aminopeptidases 
and improve stability (Proteolytic Enzymes 2nd Ed, Edited 
by R. Beynon and J. Bond Oxford University Press 2001) . 

Antibodies directed to the site of interaction in the 
protozoan Rhomboid polypeptide or adhesive micronemal 
protein form a further class of putative inhibitor 
compounds. Candidate inhibitor antibodies may be. 
characterised and their binding regions determined to 
provide single chain antibodies and fragments thereof which 
are responsible for disrupting the interaction. 

Antibodies may be obtained using techniques that are 
standard in the art. Methods of producing antibodies 
include immunising a mammal (e.g. mouse, rat, rabbit, 
horse, goat, sheep or monkey) with the protein or a 
fragment thereof. Antibodies may be obtained from 
immunised animals using any of a variety of techniques 
known in the art, and screened, preferably using binding of 
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antibody to antigen of interest. For instance, Western 
blotting techniques or immunoprecipitation may be used 
(Armitage et al . , 1992, Nature 357: 80-82). Isolation of 
antibodies and/or antibody-producing cells from an animal 
5 . may be accompanied by a step of sacrificing the animal. 

As an alternative or supplement to immunising a mammal with 
a peptide, an antibody specific for a protein may be 
obtained from a recombinantly produced library of expressed 

10 immunoglobulin variable domains, e.g. using lambda 

. bacteriophage or filamentous bacteriophage which display 
functional immunoglobulin binding domains on their 
surfaces; for instance see WO92/01047. The library may be 
naive, that is constructed from sequences obtained .from an 

15 organism which has not been immunised with any of the 

proteins (or fragments), or may be one constructed using., 
sequences obtained from an organism which has beien exposed 
to the antigen of interest. 

20 Antibodies according to the present invention may be 

modified in a number of ways. Indeed, the term "antibody" 
should be construed as covering any binding substance 
having a binding domain with the required specificity. 
Thus the invention covers antibody fragments, derivatives, 

25 functional equivalents and homologues of antibodies, 

including synthetic molecules and molecules whose shape 
mimicks that of an antibody enabling it to bind an antigen, 
or epitope. 

30 Example antibody . fragments , capable of binding an antigen 

or other binding partner are the Fab fragment consisting of. 
the VL, VH, CI and CHI domains; the Fd fragment consisting 
of the VH and CHI domains; the Fv fragment consisting of 
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the VL and VH domains of a single arm of an antibody; the 
dAb fragment which consists of a VH domain; isolated CDR 
regions and F(ab')2 fragments, a bivalent fragment 
including two Fab fragments linked by a disulphide bridge 
at the hinge region. Single chain Fv fragments are also 
included. 

The reactivities of antibodies on a sample may be 
determined by any appropriate means. Tagging with 
individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate 
detectable, and preferably measurable, signals. The 
linkage of reporter molecules may be directly or 
indirectly, covalently, e.g. via a peptide bond or non- 
covalently. Linkage via a peptide bond may be as a result 
of recombinant expression of a gene fusion that encodes 
antibody and reporter molecule. The mode of determining 
binding is not a feature of the present invention and those 
skilled in the art are able to choose a suitable mode 
according to their preference and general knowledge. 

Antibodies may also be used in purifying and/or isolating 
Rhomboid and adhesive micronemal polypeptides for use in 
the present methods, for instance following production of 
the polypeptide or peptide by expression from encoding 
nucleic acid therefor. 

Antibodies may be useful in a therapeutic context (which 
may include prophylaxis) to disrupt Rhomboid mediated 
cleavage of adhesive micronemal proteins and thus to reduce 
pathogen invasiveness in the treatment protozoan 
infections, including malaria. 
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Antibodies may also be employed in accordance with the 
present invention for other therapeutic and non-therapeutic 
purposes which are discussed elsewhere herein. 

5 As described above, adhesive micronemal polypeptide 

cleavage is. an essential step in the infection -of host 
cells by protozoan pathogens. Compounds which inhibit 
adhesive micronemal polypeptide cleavage may thus be useful 
.. in inhibiting protozoan pathogen invasiveness. 

10 

Methods of the invention may further comprise the step of 
determining the ability of said test compound to inhibit 
the invasiveness of a protozoan pathogen. 

15 This may be achieved by contacting a host cell with a 

protozoan pathogen in the presence of the test compound 
under conditions in which the pathogen normally infects the 
host cell. 

20 Invasiveness of a protozoan pathogen may be determined, for 
example, in the presence and absence of test compound, in 
tissue culture using hepatocytes or HepG2 cells. 
Alternatively, the ability of a sporozoite to glide on a 
glass slide may be determined. This motility is tightly 

25 linked to invasiveness (Matuschewski K et al EMBO J (2002) 
21 (7) 1597-1606) . 

In other assay systems, sporozoites may be injected into an 
animal model (e.g. rat) to which a test compound is 
30 administered and the extent or amount of infection of 

hepatocytes within the animal determined, and compared to 
control animals (Matuschewski K et al supra) . Determination 
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of hepatocyte infection may involve sacrificing and 
dissecting the animal model. 

A decrease or reduction in the rate of infection in the 
presence relative to the absence of test compound is 
indicative- that the- test compound inhibits infectivity. 

The test compound may further be isolated and/or 
manufactured/synthesised and subsequently formulated into a 
pharmaceutical composition with a pharmaceutical^ 
acceptable excipient, vehicle or carrier. 

It is desirable that compounds for- use in therapeutic 
contexts preferentially or specifically inhibit protozoan 
Rhomboid polypeptide relative to human Rhomboid 
polypeptide. Methods of the invention may include a further 
screen to identify such compounds. 

A method may therefore comprise the further step of; 

bringing into contact an isolated human Rhomboid 
polypeptide and a polypeptide substrate in the presence of 
the test compound; and, 

determining proteolytic cleavage of the substrate by 
the human Rhomboid polypeptide. 

A human Rhomboid polypeptide may be selected from the group 
consisting of Human RHBDL-1 (Human Rhomboid-1 : Pascall and 
Brown (1998) FEES Lett. 429, 337-340), Human RHBDL-2 
(NM__017 821) and Human RHBDL-3. 

Suitable polypeptide substrates for human Rhomboids are 
described above and are cleaved by the Rhomboid polypeptide 
within the transmembrane domain. 
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A suitable substrate polypeptide may comprise a 
transmembrane motif which has one or more residues of the 
Drosophila Spitz ASIASGA motif within the region proximal 
to the lumenal or extracellular domain of the polypeptide 
(i.e. residues 1 to 8 of the TMD starting at the lumenal 
boundary) or may comprise a transmembrane motif which none 
of the residues of the Drosophila Spitz ASIASGA motif, but 
which instead possess. a motif having an equivalent 
structure which is cleaved by Rhomboid polypeptide (e.g. 
. Gurken or other sequence shown in Figure 1). Preferably a 
substrate polypeptide comprises one or both of motifs 
described above in the region proximal to the lumenal or 
extracellular domain of the polypeptide. 

Following identification, of a compound using a method of 
the invention described herein, a method may further 
comprise modifying the compound to optimise the 
pharmaceutical properties thereof. 

The modification of a 'lead' compound identified as 
biologically active is a known approach to the development 
of pharmaceuticals and may be desirable where the active 
compound is difficult or expensive to synthesise or where 
it is unsuitable for a particular method of administration, 
e.g. peptides are not well suited as active agents for oral 
compositions as they tend- to be quickly degraded by 
proteases in the alimentary canal. Modification of a known 
active compound (for example, to produce a mimetic) may be 
used to avoid randomly screening large number of molecules 
for a target property. 
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Modification of a 'lead' compound to optimise its 
pharmaceutical properties commonly comprises several steps 
Firstly, the particular parts of the compound that are 
critical and/or important in determining the target 
property are determined. In the case of a peptide, this 
can be done by systematically varying the amino acid 
residues in the peptide, e.g. by substituting each residue 
in turn. These parts or residues constituting the active 
region of the compound are known as its "pharmacophore". 

Once the pharmacophore has been found, its structure is 
modelled according to its physical properties, e.g. 
stereochemistry, bonding, size and/or charge, using data 
from a range of sources, e.g. spectroscopic techniques, X- 
ray diffraction data and NMR. 

Computational analysis, similarity mapping (which models 
the charge and/or volume of a pharmacophore, rather than 
the bonding between atoms) and other techniques can be used 
in this modelling process. 

In a variant of this approach, the three-dimensional 
structure of the ligand and its binding partner are 
modelled. This can be especially useful where the ligand 
and/or binding partner change conformation on binding, 
allowing the model to take account of this in the 
optimisation of the lead compound. 

A template molecule is then selected onto which chemical 
groups that mimic the pharmacophore can be grafted. The 
template molecule and the chemical groups grafted on to it 
can conveniently be selected so that the modified compound 
is easy to synthesise, is likely to be pharmacologically 
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acceptable, and does not degrade in vivo, while retaining 
the biological activity of the. lead compound. The modified 
compounds found by this approach can then be screened to 
see whether they have the target property, or to what 
5 extent they exhibit it. Modified compounds include 
mimetics of the lead compound. 

Further optimisation or modification can then be carried 
out to arrive at one or more final compounds for in vivo or 
10 cLinical testing. 

Rhomboid and adhesive miconemal polypeptides may also be 
used in methods of designing mimetics which are suitable 
for inhibiting protozoan pathogen infectivity. 

15 

The present invention provides a method of designing 
mimetics having the biological activity of inhibiting the 
Rhomboid mediated cleavage of adhesive miconemal r 
polypeptides and thus inhibiting protozoan pathogen 
20 invasiveness, said method comprising: 

(i) analysing a compound having the biological activity to 
determine the amino acid residues essential and important 
for the activity to define a pharmacophore; and, 

(ii) modelling the pharmacophore to design and/or screen 
25 candidate mimetics having the biological activity. 

A suitable compound may be, for example, a protozoan 
Rhomboid polypeptide or fragment as described herein. 

30 Suitable modelling techniques are known in the art. This 
- includes the design of so-called "mimetics" which involves 
the study of the functional interactions of the molecules 
and the design of compounds which contain functional groups 
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arranged in such a manner that they could reproduced those 
interactions. 

The modelling and modification of a ^lead' compound to 
optimise its properties, including the production of 
mimetics, is described above. 

The activity or function of a protozoan Rhomboid 
polypeptide may be inhibited, as noted, by means of a 
compound that interferes in some way with the interaction 
of protozoan Rhomboid with adhesive micronemal polypeptides 
or other suitable substrate polypeptides. An alternative 
approach to inhibition employs regulation at the nucleic 
acid level to inhibit activity or function by down- 
regulating production of protozoan Rhomboid. 

For instance, expression of a gene may be inhibited using 
anti-sense or RNAi technology. The use of these approaches 
to down-regulate gene expression is now well-established in 
the art. 

Anti-sense- oligonucleotides may be designed to hybridise to 
the complementary sequence of nucleic acid, pre-mRNA or 
mature mRNA, interfering with the production of Rhomboid 
polypeptide so that its expression is reduced or completely 
or substantially completely prevented. In addition to 
targeting coding sequence, antisense techniques may be used 
to target control sequences of a gene, e.g. in the 5' 
flanking sequence, whereby the antisense oligonucleotides 
can interfere with expression control sequences. The 
construction of antisense sequences and their use is 
described for example in Peyman and Ulman, Chemical 
Reviews, 90:543-584, (1990) and Crooke, Ann. Rev. 
Pharmacol. Toxicol., 32:329-376, (1992). 
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Oligonucleotides may be generated in vitro or ex vivo for 
administration or anti-sense RNA may be generated in vivo 
within cells in which down-regulation is desired. Thus, 
double-stranded DNA may be placed under the control of a 
5 promoter in a "reverse orientation" such that transcription 
of the anti-sense strand of the DNA yields RNA which is 
complementary to normal mRNA transcribed from the s.ense 
. strand of the target gene. The complementary anti-sense 
RNA sequence is thought then to bind with mRNA to form a 
10 duplex, inhibiting translation of the endogenous mRNA from 
the target gene into protein. Whether or not this is the 
actual mode of action is still uncertain. However, it is 
established fact that the technique works. 

The complete sequence corresponding to the coding sequence 
15 in reverse orientation need not be used. For example 
fragments of sufficient length may be used. It is a 
routine matter for the person skilled in the art to screen 
fragments of various sizes and from various parts of the 
coding or flanking sequences of a gene to optimise the 
20 level of anti-sense inhibition. It may be advantageous to 
include the initiating methionine ATG codon, and perhaps 
one or more nucleotides upstream of the initiating codon. 
A suitable fragment may have about 14-23 nucleotides, e.g. 
about 15, 16 or 17 . 

25 An alternative to anti-sense is to use a copy of all or 
part of the target gene inserted in sense, that is the 
same, orientation as the target gene, to achieve reduction 
in expression of the target gene by co-suppression; Angell 
& Baulcombe (1997) The EMBO Journal 16,12:3675-3684; and 

30 Voinnet & Baulcombe (1997) Nature 389: pg 553). Double 
stranded RNA (dsRNA) has been found to be even more 
effective in gene silencing than both sense or antisense 
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strands alone (Fire A. et al Nature, 391, (1998)) and has 
been used effectively in Plasmodium (Malhotra P et al Mol 
Microb. (2002) 45(5) 1245-1254; McRobert L et al Mol. 
Biochem. Parasitol. (2002) 119(2) 273-278). dsRNA mediated 
silencing is gene specific and is often termed RNA 
interference (RNAi) . 

RNA interference is a two step process. First, dsRNA is 
cleaved within the cell to yield short interfering RNAs 
(siRNAs) of about 2i-23nt length with 5' terminal phosphate 
and 3' short overhangs (~2nt) . The siRNAs target the 
corresponding mRNA sequence specifically for destruction 
(Zamore P.O. Nature Structural Biology, 8, 9,-746-750 
(2001) 

RNAi may be also be efficiently induced using chemically 
synthesized siRNA duplexes of the same structure with 3'- 
overhang ends (Zamore PD et al Cell, 101, 25-33, (2000)) 
Synthetic si RNA duplexes have been shown to specifically 
suppress expression of endogenous and heterologeous genes 
in a wide range of mammalian cell lines (Elbashir SM. et 
al. Nature, 411, 494-498, (2001)). 

Another possibility is that nucleic acid is used which on 
transcription produces a ribozyme, able to cut nucleic acid 
at a specific site - thus also useful in influencing gene 
expression. Background references for ribozymes include 
Kashani-Sabet and Scanlon, 1995, Cancer Gene Therapy, 2(3)- 
213-223, and Mercola and Cohen, 1995, Cancer Gene Therapy 
2(1), 47-59. PY ' 

Thus, a modulator of protozoan Rhomboid activity and thus a 
modulator of protozoan pathogen infectivity may comprise a 
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nucleic acid molecule comprising all or part of a Rhomboid 
coding sequence shown in Table 1 or the complement thereof 

Such a molecule may suppress the expression of protozoan 
5 Rhomboid polypeptide and may comprise a sense or anti-sense 
Rhomboid coding sequence or may be a Rhomboid specific 
ribozyme, according to the type of suppression to be 
employed - 

10 The type of suppression will also determine whether the 

molecule is double or single stranded and whether it is RNA 
or DNA. 

Another aspect of the present invention provides a method 
15 - of producing a pharmaceutical composition comprising; 

identifying a compound which inhibits the invasiveness 
of a protozoan pathogen using a method described herein; 
and, 

admixing the compound identified thereby with a 
20 pharmaceutical^ acceptable carrier. 

As described above, a method of the invention may comprise 
the step of modifying the compound to optimise the 
pharmaceutical properties thereof. 

25 

Another aspect of the invention provides a method for 
preparing a pharmaceutical composition for treating a 
protozoan pathogen infection comprising; 

i) identifying a compound which modulates the 
30 proteolytic activity of a Rhomboid polypeptide,' 

ii) synthesising the identified compound, and; 

iii) incorporating the compound into a pharmaceutical 
composition. 
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The identified compound may be synthesised using 
conventional chemical synthesis methodologies- Methods for 
the development and optimisation of synthetic routes are 
well known to a skilled person. 

The compound may be modified and/or optimised as described 
above . 

Incorporating the compound into a pharmaceutical 
composition may include admixing the synthesised compound 
with a pharmaceutically acceptable carrier or excipient. 

Another aspect of the invention provides a compound which 
modulates protozoan pathogen infectivity obtained by a 
method as described herein. Such a compound may comprise or 
consist of a peptide fragment of a protozoan Rhomboid 
polypeptide - 

Another aspect of the invention provides a pharmaceutical 
composition comprising a compound which modulates the 
proteolytic activity of a Rhomboid polypeptide obtained by 
a method described herein!. 

In other aspects the invention provides the use of a 
compound obtained by a method described herein in the 
manufacture of a composition for treatment of a protozoan 
pathogen infection and a method comprising administration 
of a composition obtained by a method described herein to a 
patient for treatment of a protozoan pathogen infection. 

Protozoan pathogens are described above. Disorders 
associated with infection with a protozoan pathogen include 
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malaria, toxoplasmosis, cryptosporidosis , diarrhoea 
associated with Isospora belli or Cyclospora cayetanis 
infection, and various livestock disorders associated with 
Eimeria infection. 

. Whether it is a polypeptide, antibody, peptide, anti-sense, 
sense or siRNA nucleic acid molecule, small molecule, 
mimetic or other pharmaceutically useful compound according 
to the present invention that is to be given to an 

10 individual, .administration is preferably in a 

"prophylactically effective amount" or a "therapeutically 
effective amount" (as the case may be, although prophylaxis 
may be considered therapy) , this being sufficient to show 
benefit to the individual. The actual amount administered, 

15 and rate and time-course of administration, will depend on 
the nature and severity of what is being treated. 
Prescription of treatment, e.g. decisions on dosage etc, is 
within the responsibility of general practitioners and 
other medical doctors. 

20 

A composition may be administered alone or in combination 
with other treatments, either simultaneously or 
sequentially dependent upon the condition to be treated. 

25 Pharmaceutical compositions according to the present 
invention, and for use in accordance with the present 
invention, may include, in addition to active ingredient, a 
pharmaceutically acceptable excipient, carrier, buffer, 
stabiliser or other materials well known to those skilled 

30 in the art. Such materials should be non-toxic and should 
• not interfere with the efficacy of the active ingredient. 
The precise nature of the carrier or other material will 
depend on the route of administration, which may be oral, 
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or by injection, e.g. cutaneous, subcutaneous or 
intravenous . 

Pharmaceutical compositions for oral administration may be 
in tablet, capsule, powder or liquid form. A tablet may 
include a solid carrier such as gelatin or an adjuvant. 
Liquid pharmaceutical compositions generally include a 
liquid carrier such as water, petroleum, animal or 
vegetable oils, mineral oil or synthetic oil. 
Physiological saline solution, dextrose or other saccharide 
solution or glycols such- as ethylene glycol, propylene 
glycol or polyethylene glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, or 
injection at the site of affliction, the active ingredient 
will be in the form of a parenterally acceptable aqueous 
solution which is pyrogen-free and has suitable pH, 
isotonicity and stability. Those of relevant skill in the 
art are well able to prepare suitable solutions using, for 
example, isotonic vehicles such as Sodium Chloride 
Injection, Ringer's Injection, Lactated. Ringer 1 s Injection. 
Preservatives, stabilisers, buffers, antioxidants and/or 
other additives may be included, as required. 

Liposomes, particularly cationic liposomes, may be used in 
carrier formulations . 

Examples of techniques and protocols mentioned above can be 
found in Remington's Pharmaceutical Sciences, 16th edition, 
Osol, A. (ed), 1980. 

Another aspect of the invention provides a method of 
identifying a protozoan Rhomboid polypeptide comprising, 
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(a) providing a test Rhomboid polypeptide, 

(b) bringing into contact a substrate polypeptide and 
the test Rhomboid polypeptide under conditions in which the 
substrate polypeptide is normally proteolytically cleaved; 

5 and, 

(c) determining cleavage of the substrate 
polypeptide . 

A suitable test Rhomboid polypeptide may comprise an amino 
10 acid sequence encoded by a nucleic acid sequence shown in 
Table 1. Screening databases may identify other suitable 
test Rhomboid polypeptides, in particular, databases of 
protozoan nucleic acid sequence, using the bioinf ormatics 
techniques discussed above. 

15 

Suitable substrate polypeptides include protozoan adhesive, 
micronemal polypeptides. A suitable adhesive micronemal 
polypeptide may belong to the thrombospondin-related- ':r 
anonymous proteins (TRAPs) family. Examples of suitable 
20 polypeptides include AMA1, MIC2, MIC6, MIC8, MIC12, (from 

T. gondii), TRAP, CTRP (from P. falciparum and P. berghei) , 
MIC1 and MIC4 (from E. tenella) or fragments thereof which 
comprise the transmembrane domain. In particular, CTRP 
and/or AMA1 may be used. 

25 

Other aspects of the invention relate to protozoan Rhomboid 
polypeptides and encoding nucleic acids. 

An aspect of the invention provides a protozoan Rhomboid 
polypeptide which proteolytically cleaves the transmembrane 
30 domain of an adhesive micronemal polypeptide. 
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Such a polypeptide may have a sequence encoded by a nucleic 
acid sequence shown in Table 1 (for example Pf Rhol-7) or 
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may be a fragment of such a sequence which consists of 
fewer residues than the full-length protozoan Rhomboid 
polypeptide. 

The KDEL ER retention sequence is not found in natural 
protozoan Rhomboid polypeptides and directs the expressed 
Rhomboid polypeptide to be retained the ER (endoplasmic 
reticulum) rather than the miconemes of the protozoan cell. 
As described below, Rhomboid polypeptides labelled with an 
ER retention sequence such as KDEL may be particularly 
useful in certain embodiments of the invention, as 
proteolyic cleavage by such polypeptides avoids potential 
problems with variations in secretion efficiency. 

An aspect of the invention thus provides an isolated 
protozoan Rhomboid polypeptide as described herein 
comprising a C terminal ER retention sequence, A suitable 
retention sequence consists of the amino acid sequence 
KDEL. 

Another aspect of the invention provides an isolated 
nucleic acid encoding a protozoan Rhomboid polypeptide as 
described above. 

The coding sequence may be a nucleic acid sequence listed 
in Table 1 or it may be a mutant, variant, derivative or 
allele of a sequence listed. The sequence may differ from 
a sequence of Table 1 by a change, which is one or more of 
addition, insertion, deletion and substitution of one or 
more nucleotides of the sequence shown. Changes to a 
nucleotide sequence may result in an amino acid change at 
the protein level, or not, as determined by the genetic 
code . 
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Thus, nucleic acid according to the present invention may 
include a sequence different from a sequence listed in 
Table 1 yet encode. a polypeptide with the same amino acid 
5 sequence.' As described above, codon usage may be adjusted 
in order to express an amino acid sequence in a particular 
host system, such as a mammalian cell. 

v;An isolated nucleic acid may share greater than about 10% 
10 sequence identity with a nucleic acid sequence of Table 1 
greater than 20%, greater than 30%, greater than 40%, 
greater than 50%, greater than 60%, greater than about 70%, 
greater than about 80%, greater than about 90%, or greater 
than about 95%. 

15 

The present invention also extends to nucleic acid that 
hybridizes with a sequence listed in Table 1 under . 
.stringent/conditions . Suitable conditions include, e.g. 
for sequences that are about 80-90% identical; 

20 hybridisation overnight at 42°C in 0.25M Na 2 HP0 4 , pH 7.2, 
6.5% SDS, 10% dextran sulfate and a final wash at 55°C in 
0.1 X SSC, 0.1% SDS. For sequences that are, greater than 
about 90% identical, suitable conditions include, 
hybridization overnight at 65°C in'0.25M Na 2 HPC>4, pH 7.2, 

25 6.5% SDS, 10% dextran sulfate and a final wash at 60°C in 
0..1X SSC, 0.1% SDS. 

A convenient way of producing a polypeptide for use in 
assays and methods according to the present invention is to 
30 express nucleic acid encoding it, by use of the nucleic 
acid in an expression system. Accordingly, the present 
invention also encompasses a .method of making a polypeptide 
(as disclosed) , the method including expression from 



WO 2004/040009 



30 



PCT/GB2003/004711 



nucleic acid encoding the polypeptide and testing for 
Rhomboid protease activity. This may conveniently be 
achieved by growing a host cell in culture, containing such 
a vector, under appropriate conditions that cause or allow 
expression of the polypeptide. Polypeptides may also be 
expressed in in vitro systems, such as reticulocyte lysate. 

Another aspect of the present invention therefore provides 
a method of producing a Rhomboid polypeptide comprising: 

(a) causing expression from nucleic acid which 
encodes a protozoan Rhomboid polypeptide in a suitable 
expression system to produce the polypeptide recombinantly; 

(b) testing the recombinantly produced polypeptide 
for Rhomboid protease activity. 

Suitable nucleic acid sequences include a nucleic acid 
sequence encoding a protozoan rhomboid polypeptide a 
mutant, variant or allele thereof as described herein. 

A polypeptide may be isolated and/or purified (e.g. using 
an antibody) for instance after production by expression 
from encoding nucleic acid (for which see below) . Thus, a 
polypeptide may be provided free or substantially free from 
contaminants with which it is naturally associated (if it 
is a naturally-occurring polypeptide) . A polypeptide may 
be provided free or substantially free of other 
polypeptides . 

The recombinantly produced polypeptide may be isolated 
and/or tested for Rhomboid protease activity by 
determination of the cleavage of a substrate polypeptide 
such as an adhesive micronemal polypeptide upon incubation 
of the polypeptide with the substrate polypeptide. 
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An isolated nucleic acid as described herein, for example a 
nucleic acid encoding a protozoan Rhomboid polypeptide, may 
be comprised in a vector. Such a vector may further include 
a nucleic acid sequence encoding a substrate polypeptide 
such as a protozoan adhesive micronemal protein. Suitable 
vectors can be chosen or constructed, containing 
appropriate regulatory sequences, including promoter 
sequences, terminator fragments, polyadenylation sequences, 
enhancer sequences, marker genes and other sequences as 
appropriate. Vectors may be plasmids, viral e.g. 'phage, 
or phagemid, as appropriate. For further details see, for 
example, Molecular Cloning: a Laboratory Manual: 2nd 
edition, Sambrook et al . , 1989, Cold Spring Harbor 
Laboratory Press. Many known techniques and protocols for 
manipulation of nucleic acid, for example in preparation of 
nucleic acid constructs, mutagenesis, sequencing, 
introduction of DNA into cells and gene expression, and 
analysis of proteins, are described in detail in Current 
Protocols in Molecular Biology, Ausubel et al. eds . , John 
Wiley & Sons, 1992. 

Systems for cloning and expression of a polypeptide in a 
variety of different host cells are well known. Suitable 
host cells include bacteria, eukaryotic cells such as 
mammalian and yeast, baculovirus and protozoan systems. 
Mammalian cell lines available in the art for expression of 
a heterologous polypeptide include Chinese hamster ovary 
cells, HeLa cells, baby hamster kidney cells, COS cells and 
many others. A common, preferred bacterial host is E . 
coll . 
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Further aspects of the invention provide a host cell 
containing heterologous nucleic acid encoding a protozoan 
Rhomboid polypeptide, including a Rhomboid polypeptide 
which has a KDEL tag or which is a fragment of a full 
length Rhomboid sequence, and a host cell containing 
heterologous nucleic acid encoding a protozoan Rhomboid 
polypeptide and a substrate polypeptide, for example a 
protozoan adhesive micronemal polypeptide. Nucleic acid 
encoding the protozoan Rhomboid polypeptide and substrate 
polypeptide may be present on a single nucleic acid 
construct or vector within the host cell or nucleic acid 
encoding the two polypeptides may be present on separate 
constructs or vectors. 

The nucleic acid may be integrated into the genome (e.g. 
chromosome) of the host cell. Integration may be promoted 
by inclusion of sequences that promote recombination with 
the genome, in accordance with standard techniques. The 
nucleic acid may be on an extra-chromosomal vector within 
the cell. 

The introduction of nucleic acid into a host cell, which, 
may (particularly for in vitro introduction) be generally 
referred to without limitation as "transformation", may 
employ any available technique. For eukaryotic cells, 
suitable techniques may include calcium phosphate 
transfection, DEAE-Dextran, electroporation, liposome- 
mediated transfection and transduction using retrovirus or 
other virus, e.g. vaccinia or, for insect cells, 
baculovirus. For bacterial cells, suitable techniques may 
include calcium chloride transformation, electroporation 
and transfection using bacteriophage. 
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Marker genes such as antibiotic resistance or sensitivity 
genes may be used in identifying clones containing nucleic 
acid of interest, as is well known in the art. 

5 The introduction may be followed by causing or allowing 
expression from the nucleic acid, e.g. by culturing host 
cells (which may include cells actually transformed 
although more likely the cells will be descendants of the 
'transformed cells), under conditions for expression of the 
10 gene, so that the encoded polypeptide is produced. 

A Rhomboid polypeptide may be co-expressed in a host cell, 
with a substrate polypeptide, such as a protozoan adhesive 
micronemal polypeptide, and the Rhomboid serine protease 
15 activity determined by determining cleavage of the 

substrate polypeptide. Cleavage may be determined by 
determining the presence or absence of soluble cleavage 
products which may be secreted into the culture medium. >a 

20 Aspects of the present invention will now be illustrated 
with reference to the following experimental 
exemplification, by way of example and not limitation. 
Further aspects and embodiments will be apparent to those 
of ordinary skill in. the art. All documents mentioned in 

25 this specification are hereby incorporated herein by 
reference. 

Figure 1 .shows an alignment of transmembrane domains of 
micronemal proteins from apicomplexan species with 
30 Drosophila Spitz and Keren. All are single-pass type 1 

transmembrane proteins. Black lines indicate the predicted 
TMD regions: the line above the sequences predicts the 
micronemal protein TMDs; the line below predicts the TMDs 
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of Spitz and Keren. The GA or GG motif approximately seven 
residues into the TMD (double underline) and the conserved 
small residues near the luminal/extracellular face of the 
TMD (single underline) are essential for rhomboid cleavage 
5 of Spitz and are underscored. Note also the conserved 

tyrosine (Y) at the cytoplasmic face of the TMDs. Accession 
numbers of the sequences are shown in Table 2 . 

Table la shows examples of Plasmodium falciparum rhomboids ■ 
10 which were initially identified by the Pfam motif-finding 
algorithm, with a score of greater than 10. Four 
annotations are listed: sequencing consortium annotation 
(Sanger) , plus 3 automatic gene prediction algorithms, 
FullPhat, GlimmerM and Genefinder. The approximate position 
15 on the chromosome is also provided along with the predicted 
sequence around the rhomboid active site. Rhomboids 2 and 5 
are predicted to be mitochondrial, based on the existence 
of a mitochondrial targeting sequence predicted by MitoProt 
or Predotar. These genes represent the most reliable 
20 prediction of true rhomboids in the . published P. falciparum 
genome sequence as- predicted by the Pfam motif-finding 
algorithm, with a score of greater than 10. 

Table lb shows an example of a candidate Toxoplasma gondii 
25 rhomboid (ref : http : //ParaDB . cis . upenn . edu/toxo/index . html ) 

Table 2 shows examples of substrates for rhomboid proteases 
from apicomplexan species. 
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Experimental 

Identification of Plasmodium Rhomboid polypeptides . 
The published Plasmodium genome sequence was searched for 
Rhomboids polypeptides using PFAM as described above, 
selecting scores higher than 10, followed by' visual 
inspection to identify key Rhomboid residues. 

A number of putative Rhomboids were identified using PFAM . 
These were further analysed for the presence of residues 
equivalent to residues N169, G215, S217 and H281 of 
Drosophila Rhomboid-1, which are required for catalytic 
activity. Candidates identified as being within the 
subclass of mitochondrial rhomboids were also excluded by 
prediction of the existence of a mitohondrial targeting 
sequence with PREDOTAR algorithm (www.inra.fr/predotar) or 
MitoProt (Claros M . > et al Eur. J. Biochem. (1996) 241 779- 
786) 

Rhomboid polypeptides identified by this approach are 
listed in Table 1. 

Cloning of Plasmodium Rhomboid 

Rhomboid coding sequences were amplified by PCR from 
Plasmodium cDNA using the protocols set out in Sambrook & 
Russell Molecular Cloning, A Laboratory Manual, 3 rd 
Edition, Cold Spring Harbor Laboratory Press, 2001, and : 
Ausubel et al, Short Protocols in Molecular Biology, John 
Wiley and Sons, 1992. The amplified sequences were then 
inserted into a pcDNA3.1 (Invitrogen) . 

The Pf Rho 1, 3, 4 and 7 genes were resynthesised (Geneart, 
DE) with codon usage adjusted as described in Kocken et al 
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(2002) Infect Immun 70(8) 4471-4476 to optimise expression 
in mammalian cell lines. 

Expression of Pf rhomboids -1, -3 and -7 was observed in- 
COS cells. 

Cloning of adhesive micronemal proteins of Plasmodium 
Oligonucleotide primers for amplification of the adhesive 
micronemal proteins Ama-1 and CTRP of Plasmodium were 
designed based on the published sequences using 
conventional primer design software. 

The coding sequences were amplified using PCR in accordance 
with standard techniques, and cloned into a standard 
mammalian expression vector for expression in cell culture. 
The coding sequences incorporated a triple-HA tag to 
facilitate detection. Expression was detected by 
conventional Western blotting techniques using available 
anti-HA antibodies (Roche). 

Plasmodium Rhomboid Activity 

Plasmodium rhomboid activity was determined using published 
methods described in Urban et al Cell (2001) 107(2): 173-82 
and WO02/093177. 

Briefly, Pf rhomboids-1 and -3 were co-expressed with GFP- 
tagged Spitz under control of a CMV promoter; as follows; 
COS cells were grown in DMEM medium (supplemented with 10% 
foetal calf serum) , and transfected with FuGENE 6 
Transfection Reagent (Roche) . 

Cells were co-transf ected in 35mm culture wells with 25- 
250ng of (a) rhomboid construct comprising the Pf -rhomboid 
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-1 or -3 coding sequence inserted into the vector pcDNA 
3.1-1- (Invitrogen) , and (b) substrate construct comprising 
the GFP-Spitz coding sequence inserted into pcDNA 3.1 + 
( Invitrogen) . 

5 

Empty plasmid was used to bring the total DNA to lyg per 
well . 

Construct (b) was transfected into COS cells in the absence 
10 of construct (a) as a control for endogenous cleavage of 
the substrate. 

24-30 hours post-transf ection, the medium was replaced with 
serum-free medium; this was harvested 24 hours later and 
15 cells were lysed in SDS-sample buffer. 

For some experiments, the serum-free medium was 
supplemented with the metalloprotease inhibitor batimastat 
(British Biotech) or ilomostat (Calbiochem) to minimize 
20 endogenous substrate cleavage in the assay. For inhibitor 
assays, a test compound may be included in the serum-free 
medium. 

Spitz cleavage was assayed by detecting the accumulation of 
25 the GFP-tagged Spitz extracellular domain in the medium by 
standard western blotting techniques, using an anti-GFP 
antibody (Santa Cruz Biologicals) . 

Parallel experiments were also performed using a C- 
30 terminally tagged form of Spitz. Cleavage of the C-terminal 
. tagged Spitz was assayed by detecting the intracellular 
cleaved product in western blots of cell extracts. 
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Both Pf rhomboid-1 and -3 were observed to show specific 
cleavage of Spitz, which was not inhibited by batimastat. 
Batimastat is a potent inhibitor of metalloproteases that 
can cause non-specific shedding of cell surface proteins. 
These results show that both Pf rhomboid-1 and -3 are 
active rhomboid proteases that can recognise 'Spitz-like' 
TMDs . 

Cleavage of Pf adhesion protein Ama-1 

Pf adhesion protein Ama-1 was recoded with a mammalian 
codon usage profile to allow for expression in mammalian 
cells. 

Cleavage of Ama-1 by Drosophila Rhomboid-1 was determined 
using the method set out above. Briefly, Ama-1 and 
Drosophila Rhomboid-1 were co-expressed as described above 
in COS cells using the CMV promoter for both constructs. 
Standard western blotting techniques using an ant i -Ama-1 
antibody were employed to detect the accumulation of the 
extracellular domain of cleaved Ama-1 in the COS cell 
medium. 

Efficient cleavage of Ama-1 by Drosophila Rhomboid-1 was 
observed; This cleavage was not sensitive to batimastat. 
This shows that Ama-1 has a rhomboid-cleavable TMD. 

Cleavage of Pf adhesion protein CTRP 

Cleavage of the CTRP TMD by Drosophila Rhomboid-1 was also 
determined using the methods described above.. 

Briefly, Drosophila Rhomboid-1 was co-expressed as 
described above in COS cells with a chimeric protein 
comprising (from N- to C- terminus) the signal peptide of 
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Spitz, followed by GFP, followed by the Pf CTRP 
transmembrane domain, followed by the cytoplasmic domain of 
TGFalpha (i.e. the CTRP TMD with heterologous extracellular 
and cytoplasmic domains) . The CMV promoter was used to 
5 • express both constructs. 

Cleavage of . the CTRP TMD was assayed by the standard 
technique described above. If the CTRP TMD is cleaved by 
the Rhomboid protein, GFP accumulates in the medium of the 
10 transfected cells in a batimastat-insensitive manner. 

Accumulated GFP is then detected with an anti-GFP antibody 
(Santa Cruz Biologicals) as described above. 

It was observed that the TMD from Pf adhesion protein CTRP 
15 is cleaved efficiently by Drosophila rhomboid-1. 

Expression of Rhomboid in Plasmodium 

Pairs of amplification primers, specific for each P. a 
falciparum Rhomboid were designed using conventional primer 
20 design software. The expression of Plasmodium Rhomboid 

proteins was analysed by performing PGR on cDNA prepared 
from RNA isolated from the blood stage of Plasmodium 
falciparum using standard techniques. 

25 Rhomboids -1, -3, -4 and -6 are all observed to be 
expressed in blood stage Plasmodium parasites. 

Plasmodium Rhomboids are therefore present at the blood 
cell invasion stage of the parasitic life-cycle and may 
30 play a key role in mediating this invasion. 
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■ In vivo role o£ Rho mboids in Infectivitv of Parasites 

Mutant strains of P. falciparum are generated in which the 
candidate rhomboid protein is knocked out. The 
infectiousness of- the mutant parasite is then determined. 

The knockout is done either by standard .targeted gene 
disruption {Sultan et al Cell 90 511) or, more 

conveniently, by RNA interference (RNAi) , as described 
below. 

100 ug of RNA corresponding to each candidate Rhomboid gene 
is synthesized by in vitro transcription from 5 ug 
linearized plasmid templates according to. manufacturer's 
instructions (Promega Ribomax system) . The resulting RNA 
is purified using the RNeasy protocol (Qiagen) , denatured 
by boiling, and annealed in ImM Tris-HCl pH7.4, ImM EDTA 
overnight. The resulting dsRNA is then added to a culture 
of P. falciparum sporozoites (Malhotra et al Mol . Microb. 
45 1245-1254) . 

The reduced infectivity or gliding behaviour of the 
-knockout parasites relative to the wild-type controls 
confirms that rhomboid regulates the infectious process. 

Impairment of the proteolysis of the adhesive micronemal 
proteins may also be established in the rhomboid mutant (or 
RNAi ) cells, by determining the production of cleaved 
product by Western blotting using anti-TRAP antibodies 
(Sharma et al Infection & Immunity 64 2172-2179, Gantt et 
al Infection & Immunity 68 3667-3673) . 
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anger 


FullPhat 


GlimmerM 


Genef inder 


Chromosome 


Approx 
location 


Motif 


Rhomboid 


FB03040C 


chr5.phat_92 


chr5 . glm_74 


chr5 . gen_274 


5 


289kb 


GSSG 


Rhol 


FE0755C 


chr5.phat_172 


chr5 .glm_161 


chr5 .gen_236 


5 


624kb 


GASG 


Rho2 (mite 


Fll_0150 


chrll.phat_164 


chrll.glm_163 


Not predicted 


11 


538kb 


GAST 


Rho3' 


F13_0312 


chrl3.phat_576 


chrl3.glm_633 


chrl3.gen_395 


13 


2.22Mb 


GSSS 


Rho5 (mite 


F13_0241 


chrl3.phat_452 


chrl3.glm_4 94 


chrl3.gen_4 56 


13 


1.73Mb 


GASG 


Rho4 


F14_0110 


chrl4 .phat_108 


Not predicted 


chrl4 .gen_705 


14 


453kb 


SSSS 


Rho6 (?) 


AL8P1.16 


chr8.phat_47 


chr8 .glm_34 


chrB .gen_290 


8 


135kb 


GAST 


Rho7 . 



Table la 



Toxoplasma gondii 
consensus ID 


Genbank accession 


Motif 


Rhomboid 


Ctoxoqual_33 60 


AA531635 


GAST 


TgRhol 



Table lb 
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Protein 


Species 


Accession 
no . 


Pf TRAP 


Plasmodium 
falciparum 


U67764 


PfCTRP 


Plasmodium 
falciparum 


U34363 


PfAMAl 


Plasmodium 
falciparum 


AF277003 


EBL-175 


Plasmodium 
falciparum 


AAM33518 


MAEBL 


Plasmodium 
falciparum 


AY042084 


TgMIC2 


Toxoplasma 
gondii 


TGU62660 


TgMIC6 


Toxoplasma 
gondii 


AF110270 


TgMIC12 


Toxoplasma 
gondii 


Opitz et 
al. 


TgAMAl . 


Toxoplasma 
gondii 


AF010264 


PbTRAP 


Plasmodium 
berghei 


U67763 


EtMICl 


Eimeria 
tenella 


AF032905 


EtMIC4 


Eimeria 
tenella 


CAC34726 


Sm7 0 


Sarcocystis 
muris 


AAK35069 



Table 2 
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Claims 



10 



15 



20 



25 



1. An method for identifying and/or obtaining a compound 
which inhibits infectivity of a protozoan pathogen, which 



(a) contacting an isolated Rhomboid polypeptide and an 
isolated substrate polypeptide in the presence of a test 
compound; and 

(b) determining proteolytic cleavage of the substrate 
protein. 

2. A method according to claim 1 wherein the protozoan 
pathogen is an apicomplexan pathogen. 

3. A method according to claim 2 wherein the apicomplexan 
pathogen is selected from the group consisting of 
Plasmodium, Toxoplasma, Eimeria, Sarcocystis, Cyclospora, 
Isospora, Cryptosporidium, Babesia and Theileria. 

4 . A method according to any one of the preceding claims 
wherein the Rhomboid polypeptide is a protozoan Rhomboid 
protein 

5. A method according to claim 4 wherein the Rhomboid 
polypeptide is encoded by a nucleic acid sequence shown in 
Table 1. 

6. A method according to any one of the preceding claims 
wherein the substrate polypeptide comprises a lumenal 
domain and a TMD, . the TMD having a region proximal to the 
lumenal domain which comprises one or more of residues 138- 
144 of the Drosdphila Spitz sequence (ASIASGA) . 



method comprises: 
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7. A method according to claim 6 wherein the substrate 
polypeptide comprises a TMD and a lumenal domain, the TMD 
having a region proximal to a lumenal domain which has the 
sequence of residues 138-144 of Drosophila Spitz. 

8. A method according to claim 6 wherein the substrate 
polypeptide is an adhesive micronemal polypeptide. 

9. An assay method according to claim 8 wherein the 
substrate polypeptide is encoded by a nucleic acid sequence 
shown in Table 2. 

10. An assay method according to claim 9 wherein the 
substrate polypeptide is Ama-1 or CTRP. 

11. A method according to any one of the preceding claims 
wherein the substrate polypeptide and the Rhomboid 
polypeptide .comprise ER (endoplasmic reticulum) retention 
signals. 

12. A method according to claim 10 wherein the endoplasmic 
reticulum retention signals are KDEL or KKXX. 

13. A method according to any one of the preceding claims 
wherein the substrate polypeptide comprises an 
extracellular domain having a detectable label. 

14. A method according to claim 13 wherein the detectable 
label is GFP . 



15. A method according to any one of the preceding claims 
wherein said Rhomboid polypeptide and said substrate 
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polypeptide are expressed in a host cell from heterogeneous 
nucleic acid. 

16. A method according to any one of the preceding claims 
comprising the further step of; 

(c) bringing into contact an isolated human Rhomboid 
polypeptide and a polypeptide substrate in the presence of 
the test compound; and, ■ 

(d) -- determining proteolytic cleavage of the substrate 
by the human Rhomboid polypeptide. 

17 . A method according, to any one of the preceding claims 
comprising identifying said test compound as a modulator of 
adhesive micronemal polypeptide cleavage. 

18. A method according to claim 17 further comprising, 
determining the ability of said test compound to inhibit 
the invasiveness of a protozoan pathogen. 

19. A method according to claim 17 or claim 18 comprising 
isolating said test compound. 

20. A method according to claim 19 comprising synthesising 
the test compound. 

21. A method according to claim 19 comprising modifying 
the test compound to optimise its pharmacological 
properties. 

22. A method according to any one of claims 17 to 21 
comprising formulating said test compound in a 
pharmaceutical composition with a pharmaceutically 
acceptable excipient, vehicle or carrier. 
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23. A compound which modulates protozoan pathogen 
infectivity obtained by a method of any one of claims 1 to 
18. 

24. A compound according to claim 23 comprising a 
peptide fragment of a protozoan Rhomboid polypeptide. 

25. A method of' producing a pharmaceutical composition 
comprising; 

identifying a compound which inhibits the infectivity 
of a protozoan pathogen using a method according to any one 
of claims 1 to 18; and, 

admixing the compound identified thereby with a 
pharmaceutically acceptable carrier. 

26. A method according to claim 25 . comprising the step of 
modifying the compound to optimise the pharmaceutical 

. properties thereof. 

27. A method for. preparing a pharmaceutical composition . 
for treating a protozoan pathogen infection comprising; 

i) identifying a . compound which' modulates the proteolytic 
activity of a Rhomboid polypeptide, 

ii) synthesising the identified compound, and; 

iii) incorporating the compound into a pharmaceutical 
composition . 

28. A pharmaceutical composition comprising a compound 
according to claim 23 or claim 24. 
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29. Use of a compound according to claim 23 or claim 24 in 
the manufacture of a composition for treatment of a 
protozoan pathogen infection. 

5 30. A method comprising administration of a composition 
according to claim 23 or claim 24 to a patient for 
treatment of a protozoan pathogen infection. 

31. A method according to claim 30 wherein the protozoan 
10 pathogen is an apicomplexan pathogen selected from the 
group consisting of Plasmodium, Babesia, Theileria, 
Toxoplasma, Eimeria, Sarcocystis, Cyclospora, Isospora and 
Cryptosporidium. 

15 32 . A method of identifying a protozoan Rhomboid 
polypeptide comprising; 

(a) providing a test protozoan Rhomboid polypeptide, 

(b) bringing into contact a substrate polypeptide and the 
test Rhomboid polypeptide under conditions in which the 

20 substrate polypeptide is normally proteolytically cleaved; 
and, 

(c) determining cleavage of the substrate polypeptide. 

33. A method according to claim 32 wherein the test 
25 Rhomboid polypeptide comprises an amino acid sequence 

encoded by a nucleic acid sequence shown in Table 1. 

34. A method according to claim 32 or claim 33 wherein the 
substrate polypeptide comprises the lumenal region, of the 

30 TMD of Spitz, Gurken, Keren, Ama-1 or CTRP. 

35. A method according to any one of claims" 32 to 34 
wherein the substrate polypeptide comprises an amino acid 
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sequence encoded by a nucleic acid sequence shown in Table 
2 . 



36. A method according to claim 35 wherein the substrate 
polypeptide is Ama-1 or CTRP. 
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EtMIC4 


...GFPTAAVAGGVGGVLLLAAVGGGV- 


-AAY 




TgMIC2 


...SGIAGAIAGGVIGGLILLGAAG 


GASY 




TgMIC6 


...SGHAGAIAGGVIGGLLLLSAAGAG- 


-VAY 




TgMIC12 


...GVPVAAIAGGWGGVLLIAGGAGA- 


-AVY 




PbTRAP 


...SNNGI KIAGGIIGGLAI IGCIG 


-VGY 


10 


Pf TRAP 


...S DNKYKIAGGIAGGLALLACAG 


-LAY 




EtMICl 


...GFPTAAVAGGVAGGVLAIAAGAG — 


-AFY 




Sm7 0 


...GMPTAAIAGGIVGGVLLLGAAGGG- 


-AAY 




TgAMAl 


...GSNTALIAGLAVGGVLLLALLGG — 


-GCY 




DmSPITZ 


...PRPMLEKASIASGAMCALVFMLFVCLAFYL 


15 


DmKEREN 
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